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Abstract 

The problem of non-stationarity in financial markets is discussed and related to the dynamic 
nature of price volatility. A new measure is proposed for estimation of the current asset volatility. 
A simple and illustrative explanation is suggested of the emergence of significant serial autocor- 
relations in volatility and squared returns. It is shown that when non-stationarity is eliminated, 
the autocorrelations substantially reduce and become statistically insignificant. The causes of 
non-Gaussian nature of the probability of returns distribution are considered. For both stock and 
currency markets data samples, it is shown that removing the non-stationary component substan- 
tially reduces the kurtosis of distribution, bringing it closer to the Gaussian one. A statistical 
criterion is proposed for controlling the degree of smoothing of the empirical values of volatility. 
The hypothesis of smooth, non-stochastic nature of volatility is put forward, and possible causes 
of volatility shifts are discussed. 

1 Introduction 

Non-stationarity is arguably the most characteristic feature of financial markets. It is generally ac- 
cepted that statistical parameters of price dynamics vary with time. This fact is unpleasant both 
for researchers and practitioners, because any discovered regularities and elaborated trading systems 
quickly lose their relevance as time passes. The best solution to the problem of non-stationarity would 
be to include it into a probabilistic model of market operation. 

One of the most important characteristics of returns of a financial instrument is its volatility. 
There is no doubt that volatility varies over time, and this phenomenon is the subject of voluminous 
literature, for example [1!], as well as a more recent collection in [2f. There are 'quiet' periods of 
market behavior and periods of increased volatility. One can say that volatility characterizes the 
market 'temperature', the degree of its emotional tension. Forecasting future values of volatility is 
extremely important; it plays a crucial role, among other issues, in determining the pricing of options 
and assessing the risk for portfolio investors (see 0| for an extensive review). Understanding the 
causes and nature of non-stationary volatility would also lead to a deeper insight into the essence of 
the financial market processes. Various models were suggested, encompassing such diverse fields as 
theory of chaos applied by and multi- agent systems studied by Q. This task became especially 
relevant in recent years, during the unfolding financial disturbances, as well as dramatic events of 
Internet bubble (relevant discussion can be found in 

The term volatility comprises at least four different meanings: 1) the emotional characteristic of the 
market; 2) sample mean square deviation of logarithmic returns; 3) the 'true' unobservable variance of 
the underlying distribution of returns; and 4) the implied volatility in option contracts. In this paper, 
we use the term volatility in the second and third senses, which are usually referred to as realized 
and latent volatility, respectively. We refer the reader to reviews by and by @ for good overview 
of the field. The choice of robust volatility estimator is important for producing correct inferences 
from the available data (see One of the questions we focus on in our research is, which choice 
of estimator of sample volatility leads to a minimum error for a certain model of a random process. 
'True' volatility is, of course, non-observable, and the question of its nature is further complicated by 
the non-stationary nature of the markets. 
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The generally accepted approach is to consider the volatility as a stochastic variable (see, for 
example, [l(J, [3] and the collection in [HI]). One of the chief motivations for this is the presence of 
high autocorrelations in volatility and squared returns, as discussed in [12]. Compared to the near-zero 
autocorrelations in logarithmic returns, the detection of such long- memory pattern creates striking 
impression. 

In the probabilistic models with variable volatility, the price x(t) random process is described ei- 
ther by discrete or continuous equations, parameters of which are random variables. In this context, 
GARCH(p, q) model first introduced by [13j gained wide popularity, as well as its various generaliza- 
tions (see In this case, the timeline is divided into finite time intervals (lags) of duration r, and 
then only 'closing' prices of these intervals are considered Xk — x(k ■ r), where k = 1, 2... is an integer. 
Logarithmic returns rk — \n{xk/ Xk-i) are independent random variables, with variable volatility <7fc, 
the square of which linearly depends on the previous squared returns and volatilities: 

p <? 

r k = cr k e k , vl=un+^a i o- 2 k _ l +^(3 l rl_ l . (1) 

i=l i=l 

Here and below £k is uncorrelated normalized random (i.i.d.) process with zero mean and unit variance: 
~el = 0, e? = 1, el ■ Ej = 0. A line over a symbol, as usual, denotes the average of all the possible 
realizations of Ei. 

In the continuous framework, the stochastic Ito's equation is widely used for both price and 
volatility dynamics. The price is modeled (for an example, see a paper by [l5j]) by the ordinary 
logarithmic walk, and volatility is described by the Ornstein-Uhlenbeck equation: 

dx 

— = ndt + a(t)SWi, diner = (3 ■ (a-lna) dt + jSW 2 , (2) 
x 



where 8W\, 8W2 are uncorrelated Wiener variables 5W — eydt. Indeed, the term 'stochastic' volatility 
is usually reserved for this class of models, although we here use in a somewhat broader sense, to include 
the GARCH-type models. 

Sometimes, both in the discrete and the continuous models, one or more 'hidden' stochastic vari- 
ables are introduced, and volatility is considered as a function of such variables. Other, sometimes 
rather sophisticated approaches, exist in the literature (see as an example). What unites them 
all is the probabilistic description of the local dynamics of volatility (either discrete or continuous) . 

There is an extensive body of empirical research devoted to tes ting of predictive power of GARCH- 
type stochastic models over the last twenty years, surveyed in [17|], [llj], as well as the discussion 
of correct methodology for forecast estimation In general, certain skepticism regarding the 

predictive capabilities of such models is present in ongoing research. Recently, certain considerations 
were expressed that explain the persistence of autocorrelations of positively determined variables as 
the result of their non-stationarity; [ill and [13] are just a few examples of related research. 

The effect of non-stationarity is also directly related to the problem of searching for the probability 
distribution of returns of financial instrument. It is well known that this distribution is non-Gaussian; 
it has heavy tails and, consequently, manifests high kurtosis and high probability of excessively large 



or small returns. Starting with the seminal work by [21], this fact has gradually become a standard 
in financial engineering (see [22| for a modern view upon the subject). However, most approaches 
to constructing the probability distribution of random variables implicitly suppose their stationarity, 
which we do not observe at real financial markets. 

The idea that non-stationarity in the random process can cause the non-Gaussian behavior of 
returns distribution goes back as far as the classical work by it was therein tested and was not 
confirmed. Nevertheless, the question about the type of distribution and the effect of non-stationarity 
requires further careful consideration. 

In this paper we provide the arguments in support of the hypothesis that volatility a(t) is a smooth, 
rather than stochastic, function of time. The explanation of origin of high long-term autocorrelations 
and the non-Gaussian nature of returns distribution will be given. Our hypothesis also implies that 
the volatility manifests the property of resilience: under the impact of irregular, relatively rare and 
completely unpredictable shocks to the market, it gradually deforms; after such influences cease to 
act, the relaxation process takes over and volatility gradually decreases. 
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The remainder of the paper is organized as follows. First, we discuss a new measure of volatility 
and demonstrate its effectiveness. After that, the empirical stylized facts of autocorrelations associated 
with the volatility are listed, and a simple non-stationary model, in which such properties naturally 
arise, is proposed. A very clear graphic representation is provided for the mechanism of appearance 
of autocorrelations, and a simple mathematical formalism for performing the necessary calculations is 
proposed. 

The evidence that such a mechanism is actually realized in financial markets is provided by cal- 
culation of the autocorrelation function (ACF) for two modifications of original series; namely, the 
autocorrelations are vanishing for both the first differences of volatility of consecutive days, and for 
the residual series obtained by elimination of its smooth part cr(t). The empirical tests of these facts 
are carried out utilizing sample data of both stock market and exchange rate dynamics. 

Next, we show that normalizing the returns series by a(t) leads to significant reduction in kurtosis 
of distribution, in some cases restoring it to the normal form. Statistical criteria for controlling the 
degree of data smoothing are elaborated. We consider the arguments concerning the local constancy of 
'true' volatility. In Conclusion, a number of inferences about the possible properties of the dynamics 
of volatility are formulated. Various technical details are compiled in self-contained Appendices, which 
complement and detail the calculations presented in the body of the paper. 



2 Measurement of volatility 

Historical prices for various financial instruments are usually available as discrete time series, with a 
certain period of time (lag) between consecutive points. Most widely available are data with daily 
lags, hourly lags can be observed less frequently, and minute lags are even more seldom. In addition to 
the closing price C t (the latest value within a lag), other commonly utilized parameters are: opening 
quotes O t (first price of a lag), maximal H t and minimal L t price values. By means of these four price 
points one can construct three independent relative values, which we will call the basis of a lag. 




h t = H t -Ot 
It = Ot-Lt 
n = Ct-Ot 



(3) 



Figure 1: Characteristics of volatility 



The height h of price ascent and the depth of price descent I are both positive values. Out of these 
measures the amplitude of price range a = h + l can be defined (see, for example 24 1 and [25| for early 
examples of its use). The asset return r can be both positive and negative. 

If one considers the models of additive Wiener random walk dx = jj, dt + a SW, the values 
{O t , H t , L t ,C t } are asset prices. For the logarithmic random walk dx/x = jldt + aSW they are 
logarithms of price values \nx. Thus, in the latter case, for example, the range a t would be equal to 
the logarithm of the ratio of maximum price to minimum price at = \n(H t / L t ) , the corresponding r t 
equal to the logarithmic return r t = h\(Ct/Ot), and so on. 

We define volatility a of a lag with duration T as an average of asset return r deviation from the 
mean over a sufficiently large number of lags: a 2 — ((r — r) 2 }. If volatility a is constant, the values of 
positive entities {h,l, \r\,a} in certain sense serve as its measure. The higher is the market volatility, 
the more probable are their high values. In particular, in absence of drift (ji = 0), their men values 
are proportional to volatility: a — 1.596 - a, h = I = \r\ = 0.798 • a (see | Appendix A| . 

However, the informational content of each parameter, and of their possible combinations, varies. 
The distributions of the probability density for P(a) and P(h), P(l), P(\r\), in the case of Wiener 
process, are plotted in Fig. [2] (dotted lines mark the distributions' mean for a = 1). 
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Figure 2: Probability densities for the basis components 

As we see, among the basis components {a, h, I, |r|} only the range a has a sufficiently narrow 
maximum around the mean value. The density of probability of the other three values are strictly 
decreasing functions, which allows for h, I and |r| to take, with high probability, values close to zero. 
The range a, on the contrary, avoids going to zero, the probability that a < 0.75a being as low as 
0.002. Indeed, it often happens that the market closes with a near-zero change in price |r| ~ 0, while 
its volatility during the day was significant. In general, the narrower the distribution of probability 
for volatility measure, the better is this measure. For some positively determined value v, the relative 
degree of distribution narrowness can be characterized by a ratio a v /v, where a 2 = (v — v) 2 is mean 
squared deviation from the mean v. For the range we have a a /(a) — 0.30, which signifies more than 
twice as narrow distribution peak than, for instance, for the height ((Jh/h = 0.76). A natural question 
arises: is there a combination of the basis values f(h, I, r) that has a narrower distribution than the 
price range a? This topic is the subject of extensive research (see e.g. |2ij],|25|], j26j,[23]> (28|). 

In the present article we define a simple, but efficient, modification of the price range, which is 
motivated as follows. If the price dynamics within the lag is accompanied by a significant trend \r\ ^ 
(whether it is going up or down), the volatility may appear lower than for the same price range, but 
in the absence of trend (|r| = 0). Therefore there are good reasons to decrease the value of the range, 
as a measure of volatility, in the case when \r\ is large. We achieve this by introducing the following 
volatility estimator, which we call modified price range (see | Appendix B[ ): 



Its statistical parameters - mean (av), standard deviation (si), skewness (as) and its kurtosis (ex) for 
(7 = 1 are listed in Table [TJ 



Table 1: Statistical parameters of probability distributions for |r|, a and v. 





av 


si 


as 


ex 


si/av 


\r\ 


0.798 


0.603 


1.00 


0.87 


0.76 


a 


1.596 


0.476 


0.97 


1.24 


0.30 


V 


1.197 


0.300 


0.53 


0.26 


0.25 



One can see that the relative width of the distribution of modified range a v /v = 0.25, which is 
better than that of simple range o. The statistical parameters also show that the distribution for v is 
more symmetrical around the maximum and has a lower kurtosis than a. The form of distribution for 
P(v) together with P(a) (dotted line) are plotted in Fig. [3l and there we also provide the expressions 
for the average v and its square for the case of the Brownian walk. Thus, the modified price range 
provides a better measure of volatility than the simple range, and significantly better than absolute 
logarithmic returns. In |Appcndix B| we compare the modified range with several other ways of 
volatility measurement utilized by other authors. Providing for the same or lower error of volatility 
estimation, the measure v has a significantly more simple definition, and is unbiased for the small 
number of lags, so we will use it extensively throughout this paper. 
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Figure 3: Probability densities P(v), P(a) and mean values for v and v 2 

3 Intraday volatility 

We shall demonstrate the effectiveness of modified amplitude of range on the available realized volatil- 
ity data. Here we consider 15-minute quotes at the Forex market for the period from 2004 to 2008 
for EURUSD currency pair. We shall make them aggregated into daily points, calculating, beside 
minimum and maximum meanings, intraday volatility basing on logarithmic returns of 15-minute 
lags: 

n 

^^iE(rr^. (5) 

2=1 

During a day, we have n = 96 = 4 • 24 15-minute lags. Multiplier n in {1} turns 15-minute volatility 
into the daily value. The evolution of intraday volatility is given in the Fig. 3] (data for 1250 trading 
days, excluding weekends and major holidays): 
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Figure 4: Intraday volatility of EUR/USD rate 



One can observe that since the fall of 2008, volatility of the currency market, as well as of other 
financial markets, has increased dramatically, due to the worsening financial crisis. However, even in 
the pre-crisis period, volatility has a clear-cut non-stationary component. 

It is natural to assume that realized volatility obtained from a sample of n = 96 characterizes the 
'true' volatility better than does a daily basis of three values {h, I, r} (see [ijj, [HO], j3l[) even though 
there are various high-frequency effects that one has to take into account (discussed in detail by [321 ] . 
[33T ]. and (34[). To find a more robust measure of volatility, based on the basis, one should look for 
a value stronger correlated with the intraday volatility. Let us chart the scatter plots of dependence 
of daily values of v t , a t and \r t \ on intraday volatility a t (EUR/USD for period 2004-2008, Fig. [5]). 
It can be easily seen that v t and a t are substantially more correlated with <r t , than with \r t \. The 
transition from logarithmic range a to modified range v makes the correlation more pronounced, but 
the difference is not significant. 

Similar results are observed for other currencies. The slope of regression lines v t ja t and at /at for 
six currency pairs are given in Table [2l In each case the error of linear approximation for v is lower 
than for a, and significantly lower than for \r\. 
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Figure 5: Dependencies of v(a), a(a) and \r\(a) 



Table 2: Slope of regression lines v t /a t and a t /a t for six pairs of currencies 





eurusd 


gbpusd 


usdchf 


usdjpy 


usdcad 


audusd 


average 


(v/a) 


1.263 


1.260 


1.289 


1.251 


1.241 


1.243 


1.258 


(a/a) 


1.671 


1.665 


1.692 


1.640 


1.621 


1.660 


1.658 


{\r\/a) 


0.817 


0.809 


0.807 


0.776 


0.761 


0.834 


0.801 



Despite the noticeable variation, the values of v/a, a/ 'a and \r\/a are close to their theoretical 
values for Wiener random walk, 1.197, 1.596 and 0.798, respectively. Nevertheless, we must keep in 
mind that, for example, the expression v/a = 3/y/(2Tr) holds only for the Brownian random walk 
with normal distribution of returns. In reality, this condition is not fully satisfied, so the ratio v/a 
may be equal to some constant different from 3/ y^n), and its exact value we will discuss below. 

Another indication of significance of modified price range are autocorrelation coefficients that will 
now be the object of our interest: 



p a (v) = cor(v t ,v t -s) 



((v t - v)(v t - s - v)) 



(6) 



where the averaging is carried out for all the observed values of v t = Vi,..,v n . For interdaily rates 
of EUR/USD (2004-2008) we obtain (as shown in Fig. [6]) the autocorrelation charts as a function of 
shift (in days) parameter s. As can be observed from Figure [6l autocorrelations of intraday volatility 
Pi(a) = 0.77 are the highest, followed by modified price ranges pi(v) = 0.54, then by simple range 
pi(a) — 0.47, and the weakest correlations are those of absolute logarithmic returns pi(|r|) = 0.11. 
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Figure 6: Correlograms of volatility for EURUSD 



High autocorrelations appear for a variety of financial instruments and are quite an intriguing fact 
( [35| provides the list of other so-called stylized facts, as well as an excellent compilation of references 
to relevant research). In contrast, the first autocorrelation coefficient of EUR/USD rate returns is 
equal to pi(r) = —0.02, which corresponds to the absence of correlation, if one takes into account 
that 2a rule gives an error band of 0.06 (for 1250 trading days). This unpredictability of the market 
returns is one of manifestations of its market effectiveness. 

However, the situation is quite different for absolute returns, and even more so for volatilities, 
which have slowly decaying long-range ACF function. Basing on this fact a huge number of stochastic 
models have been constructed, which claim the ability to predict the future values of volatility (see 
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[8] for a recent review). The majority of these models have empirical nature, and do not explain the 
causes of high autocorrelations. One of our tasks in the present paper will be to propose such an 
explanation 

4 Empirical features of autocorrelations 

We now extend our analysis by outlining a number of features of autocorrelation coefficients pertinent 
for volatility. 

1. Autocorrelations decay monotonically and very slowly. 

This is a well-known result (see [36], [37]). A number of papers were devoted to attempts on deter- 
mining the functional dependence of autocorrelation coefficients from the shift parameter s. Usually, 
autocorrelations are approximated by a power law s~ M , where the parameter [i turns out to be small. 

2. The longer is the time interval, the higher are autocorrelations. 

Let us consider the behavior of ACF for daily modified range v — a — \r\/2 for S&P500 stock index for 
the period from 2001 to 2006. We split this interval into two three-year periods, namely, from 2001 
to 2003 and from 2004 to 2006. During the first one there were n — 752 trading days, while during 
the second - n — 755. We calculate autocorrelation coefficients separately for each period, as well as 
the autocorrelation of combined data series. 

The resulting autocorrelograms are represented in Fig. [7] (the combined ACF is repeated on both 
plots). As can be noticed, the summarized correlogram goes above the correlograms of each period. 
However, this behavior is not observed for any asset in any circumstances, and the conditions that 
are required for this to occur will be clarified during further discussion below. 
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Figure 7: Correlograms of S&P500 for different time periods 



Here and below the dotted horizontal lines in the correlograms mark the double standard error 
band ±2/-\/(n), where n is the number of points involved into the calculation. Table [3] shows the 
main statistical parameters of daily logarithmic returns of S&P500 index for different periods. In 



Table 3: Main statistical parameters of daily logarithmic returns of S&P500 index 



Period 


n 


r 


a 


as 


ex 


Po,% 


Pi,% 


pi(v) 


2004-2006 


755 


0.032 


0.659 


-0.02 


0.25 


55.9 


69.4 


0.16 


2001-2003 


752 


-0.023 


1.376 


0.20 


1.27 


48.9 


71.4 


0.42 


2001-2006 


1507 


0.005 


1.078 


0.15 


2.84 


52.4 


75.7 


0.55 



addition to the mean (r), daily volatility er, skewness(as) and kurtosis(ea;), we also present here the 
percentage of positive returns po = p{r > 0) and a share of returns falling within one sigma of the 
mean: p\ = p(\r — f| < a). 

Table[3]illustrates the fact that when the market is calm (2004-2006: a = 0.659%), the distribution 
of asset returns is close to normal (ex = 0.25). However, the normality deteriorates significantly after 
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Table 4: Volatility autocorrelation coefficients for EUR/USD exchange rate, with and without 2008Q4 
data 

Period n pi(a) pi(v) pi (a) MM) 

2004Q1 .. 2008Q4 1302 0.80 0.54 0.47 0.11 
2004Q1 .. 2008Q3 1215 0.51 0.25 0.16 0.01 



we extend the time interval under consideration. Simultaneously the autocorrelation of volatilities 
starts to increase pi(v) = cor{vt,Vt-i). 

A similar situation can be observed in the foreign exchange market. Discarding the data from 
recessionary fourth quarter of 2008 reduces significantly the autocorrelation coefficients of data series 
related to the volatility of EUR/USD pair, as illustrated in Table 2) We note that dropping the 
2008Q4 data reduces the number of days for which the autocorrelation coefficients are calculated by 
merely 7%. 

3. Scatter plot of volatility has a 'comet-like' shape. 

Let us consider the scatter plots for modified range parameter {vt-i, Vt}, illustrating the 'existence 
of memory' of volatility for the three periods of S&P500 index, discussed above. As can be seen from 




Figure 8: Dependence diagrams a t (a t _i), S&P500 



Fig-El data points fill the region of a distinctive 'comet-like' shape, its tail fanning out into the positive 
values of both axes. Naturally, the higher the autocorrelation coefficients are, the more distinctive is 
the form of the dot cloud. 

The shape of region at = f(a t - s ) is completely independent of the shift s and the utilized volatility 
measure. For EUR/USD currency pair over 2004-2008 period, we have the scatter plots of intraday 
volatilities, obtained from 15-minute lags, are presented in Fig. [9) There, three values of the shift 
are presented: one day (s = 1), one week (s = 5), and two weeks (s — 10). It can be seen that the 
form of 'comet-like' shape doesn't change qualitatively, but rather spreads out gradually along with 
the decrease of autocorrelation coefficient. 
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Figure 9: Dependencies a t (a t - s ), s = 1,5, 10, EURUSD 
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5 When autocorrelations do not decay 



Actually, slowly decreasing autocorrelation coefficients as a function of the shift parameter, ought to 
be a cause for alert. There are very simple models that exhibit similar long-correlations effects without 
employing the notion of stochastic volatility (see [19J for one ingenuous example) . 
Let us consider, for example, an ordinary logarithmic walk: 

— = n dt + a SW. (7) 

x 

and simulate 20 years (5000=20-250 trading days) of price evolution; volatility is defined as constant 
equal to or = 1% for the first 10 years, and changes to another constant value of a% = 2% in the second 
half of the period. Wiener's process is represented as SW = ey/dt, where e is normally distributed 
random variable with zero mean and unit variance. We choose one second dt = 1/(24 • 60 • 60) as a 
small time interval dt. 

The dynamics of daily values of the modified price range v t — a t ~ \rt\/2 during 'critical' 10th and 
11th years has the shape plotted in Fig. [10] (where time is in 'days'). 




Figure 10: Two years of random walk around the 'switch' in volatility 



Such data series with a one-time shock non-stationarity exhibits noticeable autocorrelation coeffi- 
cients for the absolute returns (plotted in the second panel of Fig. [11]), and even higher autocorrelations 
for the price range (third panel). 
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Figure 11: Autocorrelations of returns, absolute returns and modified price range. 



The decay of ACF is very slow with the increase of shift parameter s. In contrast to \r t \ and Vt, 
the correlations of price returns r t (the first plot above) lie within two standard errors, and thus are 
practically absent. 

Therefore, correlation regularities arise in the considered toy model, despite the statistical inde- 
pendence of the two consecutive days. We stress that not only the returns r are independent, but 
so are the absolute returns |r|, and amplitudes of price v. If volatility were constant for the whole 
modeled period, all the correlograms p s (|?i) an d Ps(v) would be equal to zero. It is when we introduce 
non-stationarity that the picture is qualitatively changed. 

The cause of this effect can be easily understood. Fig.[TT]contains three scatter plots that represent 
values of logarithmic returns, their absolute values and modified price ranges of two consecutive days 
during the first decade of evolution with constant volatility er = 1%. In the first plot, the dots form 
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an almost symmetrical cloud, and the correlation is evidently equal to zero. In the second and third 
plots, there is symmetry is reduced, in agreement with the corresponding symmetry features of the 
probability densities P(|r|) and P(v). However, due to the independence of consecutive days, the 
correlation coefficient is equal to zero. For example, if x — Vt, and y — Vt-i, the independence 
means that the joint density of distribution is equal to the product of probability densities P(x , y) = 
P(x) ■ P(y). Therefore, for any distribution the covariance will be equal to zero: (x — x)(y — y) = 0. 

It is important to emphasize the fact that for the returns r t the center of the data cloud is located 
at the origin of coordinates, whereas for the positively determined values \r t \ and v t it is displaced to 
the right and up to the region of positive values. 

Now let us add the dots corresponding to the data of the second decade to the diagram (see 
Fig. f]~3)) . For logarithmic returns (first diagram) two clouds with the same center f — overlay. The 




Figure 13: The complete data of the model 



resulting cloud remains symmetrical, that is why the autocorrelation is still equal to zero. In the case 
of the price range (third diagram), there are two non-concentric clouds, one of which corresponds to 
(7 1 = 1% and second one to 02 — 2% (we remind that v = 1.197<r). The overlapping area between the 
clouds dithers, and a figure of a characteristic comet-like shape appears as a result (the upper cloud 
is larger). Using the least-squares criterion, one can draw a line through it, the slope which will be 
proportional to the correlation coefficient. 

The shape of diagrams do not change if we plot the data for the case of two days' shift {vt, «t-2}- 
Indeed, with the exception of few transitional points around the volatility jump, all the data for each 
decade will still be clustered in its cloud. 

The situation with the second diagram for the absolute returns {|r t _i|, \r t \} is somewhat more 
complicated. Visually, it is not qualitatively different from the corresponding one for the first decade; 
nevertheless, the non-zero correlation is present. In order to understand this phenomenon, it is 
necessary to extend the standard statistical relations to the case of non-stationary data. 



6 Non-stationary statistics 

Let the distribution parameters of a random variable x vary smoothly with time. If we calculate the 
mean of x over a given time interval T without taking the above mentioned statement into account, 
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we will actually obtain the following expression for x: 

T oo 

*-«0>-i /««)«, — *(.)-/.-P(x,.)«fe (8) 

-oo 

In other words, in every fixed moment of time we calculate a local mean x(t), and then average all 
such local mean values over the time interval T (denoted by angle brackets). Similarly, let us define 
local variance as: 

oo 

— OO 

The variance calculated on all dataset will be equal to: 

^ = jx~^w = <^(*)> - <^)) 2 = (°{t?) + (m 2 ) (m) 2 , (io) 

where the angle brackets, as above, denote averaging over time interval T. Thus, a 2 is made up of 
two distinct parts, namely, it is a sum of weighted local variance (a 2 (t)) and time variance of mean 
(second and third terms in equation l[T0|) ). 

In the special case of parametric non-stationarity, Xt can be represented as Xt = n(t) + a(t) • ijt, 
where r\ t represents stationary independent random process with zero mean and unit variance (rj = 0, 
i] 2 = 1). The mean value of xt is equal to (//(£)), and variance is given by: (cr(t) 2 ) + (/i(t) 2 ) — (/*(£)) . 

Let us now consider two locally independent variables x and y. Their independence means that 
the density of joint probability in any fixed moment of time t decomposes into product P(x,y,t) = 
P(x,t) ■ P(y,t), and 

jTy(t) = x(t)-y(t)- (11) 

However, when averaged over all data, these variables cease being independent. Indeed, the time mean 
of the product x ■ y: 

T oo T 

x^y = ^J J x-y P(x, y, t) dxdydt = x{t)y{t) dt = (x(t) ■ y{t)) , (12) 

-oo 

and this expression is not equal to the product of time means: x-y = (x(t)) ■ (y(t)). In general, if local 
means x(t), y{t) are non-zero, the correlation coefficient is non-zero as well. As we observed from the 
example of the previous section, the mean of returns in each decade was equal to zero, that is why the 
autocorrelation did not arise for r. In contrast, for the positively determined variables \r\ and a the 
mean is non-zero, and autocorrelation is present, despite the independence of two consecutive days. 

Thus, locally independent variables that have similar long-term non-stationarity, become depen- 
dent when we take into account their evolution in time. However, such dependence does not have 
stochastic nature, but rather 'deterministic', smooth one, related to time synchronization. 

For example, if the non-stationarity of volatility has a shape of a step-function with equal duration 
of both periods, the mean and variance of the whole dataset are equal to: 

_ Xi +X 2 2 ^1+^2 | (^l-^2) 2 ,,„s 

X = —2~ ' ° = 2 + 4 ' (13) 

where x replaced either \r\ or v, and statistical parameters of the first and second decades are given by 
x\, a x i, and X2, o~ x i- If the shift in the calculation of the autocorrelation coefficient is small compared 
to the length of T, in the first approximation one can neglect the boundary effects, and assume that 
x = vt and y — V(t — 1) are independent. Their covariance is equal to: 

xiyi + x 2 y~2 X1 + X2 2/1 + 2/2 (xi - x 2 )(yi - y 2 ) 

x-y — x-y = • = . (14) 

w y 2 2 2 4 J 

As Xi — y~i, we receive for the autocorrelation coefficient: 

, s (^1 - ^2) 2 

cor(x,y) — — —^5 s-r-. (15) 

1 iy > (x 1 -x 2 ) 2 + 2(a 2 xl +a 2 x2 ) V ; 
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We can see that such 'correlation' for non-stationary data appears only for variables with different 
means. For v and |r|, mean and variance are proportional to volatility of logarithmic returns x = aa, 
a x = [3a. For absolute returns, a — 0.795, (3 — 0.605, while for the modified price range a = 1.197, 
(3 = 0.300. If volatility of the market changes, so do the mean values of positively determined 
variables v and |r|, For our model, the volatility of logarithmic return is increased by factor of 2, and 
the corresponding correlation is given by 1/(1 + 10(/3/a) 2 ). For absolute returns this is equal to 0.15, 
and for the modified range, 0.61. This agrees exactly with what we have observed in the numerical 
experiment. 

In general case, in order to obtain the autocorrelation coefficients as function of shift s, we should 
use their definition as sums. However, the presentation in continuous time is more compact. Let us 
assume T and shift s to be continuous variables. In case of n lags with duration of r each we have 
T = tit, and s — kr, where k <C n. Let us consider a positively determined variable at, related to 
volatility, which is modulated by a non-stationary component at — a(t) ■ 9 t , where 0t is a stationary 
random variable with a unit mean. For example, for the modified amplitude, 0t — VtV2ir/3. Since 
random variables 6 t at different times are non-correlated and positively determined, we obtain that 
0t ■ @t-s is equal to 1 for s ^ 0, and to ff 1 for s = 0. Let us define the covariance for the case s ^ as 
follows: 

t r T i 2 

1 r 1 r 

(16) 



7 S (cr) = (a t ■ a t - s ) - (a t ) = — _ - / a(t)a(t - s) dt 



- J a{t) dt 



We note that this is not the only possibility in the case of a finite sample with duration T. In any 
case, we require that the covariance (fT6|l is equal to zero if a(t) = const. The variance of a positive 
variable a t equals to 70 = 2 (cr 2 (t)) — (a(t)) 2 . Accordingly, the autocorrelation coefficient p s — j s /lo 
allows to find the shift parameter dependence for different forms of non-stationarity. 

We thus see that autocorrelations of various measures of volatility can arise due to smooth non- 
stationarity in data, rather than because of the stochastic nature of volatility. At this point, a natural 
question comes up: does such a mechanism represent the reason why noticeable autocorrelations of 
volatility are observed in various financial markets? 



7 Autocorrelation of differences 

The easiest way to eliminate the relatively smooth non-stationarities in a time series is to switch to the 
differences of the data series. If v t undergoes a locally constant drift, autocorrelations for this process 
are present. If one considers the differences of two consecutive data points, the drift is effectively 
cancelled. Even if the trend in v t slowly changes its direction, within the ascending and descending 
parts the values of differences change only slightly and become locally quasi-stationary. 
Let us shall consider the change in the modified price range: 

6v t = v t -v t -i. (17) 

Our data sample is represented by daily statistics on S&P500 stock index for the period of 1990-2008 
(4791 trading days), and daily data on EURUSD exchange rate (1999-2008, 2495 days, excluding 
holidays) . Let us start with obtaining the autocorrelation coefficients of the amplitudes of daily price 
range p s (v) — cor(v t , v t - s ), with result plotted in Fig.HH As usual, the coefficients p s are considerably 
high; the autocorrelations for S&P500 index are more significant than those for EUR/USD exchange 
rate, and manifest weaker fluctuations. 

Let us now consider the differences the price range of two consecutive days; we find that for differ- 
entiated series, the autocorrelation p s {Sv) drops sharply, as can be seen from Fig. |T5l The dissimilarity 
between these two behaviors is striking. The second autocorrelation coefficient for S&P500 index is 
reduced by factor of 24, from the value of 0.618 to 0.026. For the EUR/USD rate the decline is 
17-fold - from 0.449 to 0.027. Dotted lines in all figures indicate the double standard error, equal 
to 0.03 = 2/V4791 for S&P500 index and 0.04 = 2/V2495 for EUR/USD. The disappearance of 
correlation can be manifestly demonstrated by means of the scatter plots of consecutive values of v t 
and V(ts). 
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Figure 14: S&P500 and EUR/USD correlograms of the daily of price range 
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Figure 15: S&P500 and EURUSD correlograms after for differentiated price range 



The two diagrams plotted in Fig. [16] clearly show the presence of correlations between {vt,vt-i} 
of S&P500 index, and their absence for {5v t , Sv t -2}- In the left chart dots fill the area with a 
characteristic comet-like shape, while in the right one they form a symmetrical cloud of zero correlation. 
The similar results, with autocorrelation coefficients being equal to zero, are also obtained for absolute 
logarithmic returns |r t |, as well as for other financial instruments. 




Figure 16: First aurocorrelation for S&P500 before and after switch to differences 



We note, however, that for the differences Svt a high negative autocorrelation appears for a shift 
of one day, pi{Sv) — cor(5v t , 5v t -\). In the above example it is equal to -0.49 for S&P500 index and 
-0.53 for EUR/USD. However, its origin is not due to the stochastic dynamics of volatility, but rather 
caused by the overlapping effect. We now elucidate it by way of example. Let us assume that the 
following simple model governs the price process: 

v t =a-6 t , (18) 

where a = const, and 9t are stationary independent positive random variable that arises because of 
the errors caused by the finiteness of the sample that is used for volatility measurement. In this case, 
the differences Sv t — a ■ (6 t — 9t-i) have zero mean 5v t — 0. The first autocorrelation coefficient equals 
to 



(Sv t ■ Svt-i) - <J 2 ((0t ~ Ot-i) ■ (0t-i - 6t-2)) = -o- 2 



2 



a 2 -al (19) 
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where o~\ is the variance of 9. The mean of square arises in the term — {8 t -\ ■ 6t-i) = 2 , which is the 
one responsible for the effect of overlap. In the same way, the variance of difference (Svf) = 2a 2 a 2 
is obtained. Thus, first autocorrelation coefficient is exactly equal to pi(Sv) = —0.5, as we have seen 
above. Correlations with shifts of s > 1 will be equal to zero, because there is no overlap in this case. 

The fact that for the autocorrelations of differences the relations pi(Sv) — —0.5 and p s (Sv) = 
(s > 1) hold with a good degree of accuracy corroborates the model (fT8| . However, if the parameter a 
were a constant, there would be no correlation between consecutive values of volatility p s (v) — (due 
to the independence of 9 t ). The correlation may occur, as we have shown above, as a consequence of 
gradual change in a over time. Therefore, actually a = a{i) is a smooth function of time. 

Both for the conclusive clarification of the situation with pi(5v), and for the purposes of further 
research, we need a method for extracting of the smooth non-stationary component of volatility. 



8 Filtering smooth non-stationarity 

For the extraction of slowly varying component in the process Xk — x(tk) we will use the Hodrik- 
Prescott filter [HI (referred to as HP-filter below). The smooth component Sk of the series can be 
found by way of minimizing the squares of its deviations from empirical data Xk, along with the 
requirement of curvature minimality for Sk : 



(xfc-s fc ) 2 + A-^(V 2 s fe ) 2 = min, (20) 



k=l k=1 



where the second difference is given by V 2 Sfc = (s^+i — Sfc) — (sfe — Sfc-i). The higher A parameter is, 
the more smooth shape Sk one receives as the result. The value of A can vary in a very wide range, 
so it is convenient to use it's decimal logarithm v instead, so that A = 10". 

When one deals with heavily noisy data, there is always certain freedom in the choice of A param- 
eter. If A is small, there is a danger of detecting bogus non-stationarity where it does not exist. With 
little smoothing, s& component will follow any local fluctuations, which do not have any relation to 
non-stationarity. On the other hand, with strong smoothing we risk missing important details of the 
process dynamics that is the focus of our interest. 

Therefore, we need a certain statistical criteria of the degree of smoothing in order to reduce the 
possible arbitrariness. As usual, we will use the random walk as the yardstick. 

The mean value of logarithmic returns is equal to the relative change in price within the time lag 
r t = In C t /O t . We measure the volatility basing on a smoothed mean of modified range within a lag 
a(t) = (a — \r\/2) ■ \/2~jt/3. Here, when using the term 'volatility', we always assume volatility of a lag 
(whether it is minute, hour, day, etc.). 

If the number of discrete price ticks within a lag is sufficiently large, then regardless of the intra-lag 
distribution, logarithmic returns r t will be uncorrelated Gaussian random numbers. Let us smooth 
their mean value f(t) using the HP-filter with different parameters A and calculate the typical value 
of Err[f(t)] for fluctuations f(t) around the average (r) for all empirical data: 



Err[f(t)] = y/((f(t)~r) 2 }. (21) 

Similarly, we determine the error of calculation of smoothed volatility of a lag. Our numerical simu- 
lations show that these errors, with a good degree of accuracy, decrease as the A parameter grows, as 
follows: 

_ r / \ "i 0.50 a ,_, r . .-, 0.15 a , . 

Err[f(t)] « ^ Tyr , Err[a(t)] » ^ rjr , (22) 

and manifest no noticeable dependency on the number of empirical points n. Moreover, the errors do 
not depend on the type of distribution (for a discrete model of random walk) . The rather small power 
exponent of 1/8 clarifies the reason why one needs to vary A parameter over a wide range of values. 

The expressions (|22|) define a typical corridor of oscillations for the smoothed variables f(t) and 
cr(t), which are fluctuations and are not statistically significant for constant volatility. Therefore, 
we use them as criteria of statistical significance, at least for the sections of data where a{t) is 
approximately constant. 



14 



Let us consider a typical example of a numerical simulation (a = 1, n = 1000) for the three values 
of A (y — log 10 A). The boldest line in Fig.[17]corresponds to A = 1000000 (y — 6), and the thinner one 
to A = 1000 [y = 3). The solid horizontal 'significance levels' define the double error band ±2Err[f(t)] 
in case of v — 6, and similar dotted lines, for v = 3 and v = 9. In contrast to significance levels of 
correlation coefficients, we have a smooth variable f(t), which may for some time dwell outside the 
band defined by the statistical error. Nevertheless, the relations f22|) indeed characterize the behavior 
of typical fluctuations of a smoothed variable for random data. 
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Figure 17: Smoothed mean of Gaussian noise 



However, in the non-stationary situation, which is a matter of our main interest, we should keep 
the smoothing factor on the balance. For example, if we model the process a(t) = 1 + 0.5 ■ sin(27rf/T), 
where T is the total duration of the simulated data series, we get the following behaviors of volatility 
smoothing (where volatility is measured by way of modified price range) . 




Figure 18: Smoothed volatility of random walk with a(t) = 1 + 0.5 • sm(2nt/T) 



One can see from Fig. [18] that in this case the optimal value is v — 6, as v = 3 follows too closely 
the noisy fluctuations around the true volatility, while v = 9 simply does not 'catch' the periodic 
nature of a(t). However, the situation deteriorates dramatically, if volatility suffers a shock jump. 
Thus, let us consider the process, where for half of n = 1000 'trading days' the volatility is a = 1%, 
and for the second half a = 2%. For this model, HP-smoothing with different A gives the results 
plotted in Fig. Q21 



\a(t) 




















Figure 19: Smoothed volatility of process with step-constant a(t) 



We see that in this case the choice of v = 6 blurs the step significantly. On the other hand, 
smoothing with v = 3 approximates the jump in volatility much better, but produces noisy and 
spurious fluctuations for constant a. 
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9 Autocorrelation of normalized volatility 



Let us now use the HP-filter to separate the smooth non-stationary part of volatility and filter it out 
from the data. We will focus on the higher-frequency component of volatility that remain after such 
filter is applied, as well as on the corresponding autocorrelation coefficients. 

Let us consider the daily modified price range v t = a t — |rt|/2 for EUR/USD exchange rate for the 
period from 1999 to 2008. Using this empirical data, we now estimate daily volatility a t — VtV2n/3 
and plot it in Fig. [2Ql 
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Figure 20: Volatility of EUR/USD measured by price range 

We extract the non-stationarity from the price process using HP-filter. The bold line at the chart 
below represents the volatility smoothed with A = 1000000 {y = 6). The double error band, according 
with the equation l(22|) . for the value of volatility of 0.5 (the average for years 2004-2007), will have 
the width of ±0.026. In fact, it is only slightly wider than the width of the line. Therefore, the curves 
in the graph of non-stationary volatility a(t) for v — 6 can be regarded as statistically significant (see 

Fig. eh 
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Figure 21: Smoothing the volatility with HP-filter 

The picture changes when smoothing is performed with v = 5 parameter. Let us take the graph 
for <r(t) smoothed with v = 6, and plot the double error band (1 ± 0.036) • a(t) around it (marked 
by dotted lines), which corresponds to the significance levels for v = 5. As can be observed from the 
chart, the v = 5 smoothed volatility (thin line) is more curvy than the one for v = 6; however, all 
the bends of the graph lie within the double-error corridor, and thus one could assume they are not 
statistically significant. On the other hand, the v = 5 smoothed volatility models noticeably better 
the behavior of the empirical data around the shock point in fall of 2008. 

As can be seen from the previous section, the HP-filter keeps the curvature of the whole curve as 
low and as constant, as possible. Therefore, it gives good results for relatively quiet intervals, while 
producing larger distortion when the process goes through abrupt changes. 

Now we proceed to eliminate the smooth trend a(t) from the data. We do this not by subtracting 
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it, as is common practice in the time series processing, but rather divide by it: 



a(ty 



(23) 



The meaning of this procedure is clear; it ensures that the volatility is normalized for the entire data 
series. As a result of this procedure, the volatilities adjust not only their average, equal to 1, but also 
their variance, as can be readily seen from Fig.[22l Let us now compare the autocorrelation coefficients 




Figure 22: Price ranges after normalization 



before the normalization procedure (j23l) is applied (Fig.[23l left), and after it is applied (Fig.[23l center 
and right). As can be seen, the normalization reduces autocorrelations by nearly 10-fold. The same is 
true for the first correlation coefficient, which for the price range differences is equal to -0.50. Thus, 
its origin is indeed related to the effect of overlap discussed above. 
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Figure 23: Autocorrelations before and after normalization 



We note that during the normalization procedure we divide all daily amplitudes by the smoothed 
variable a(t). However, when we calculate it, we use a set of values at at and around the current time 
t. As a result, the neighboring values of <j{t) could appear significantly correlated. This may lead to 
small autocorrelation present after normalization; nevertheless, the value of p s (v/v(t)) is very small. 

Thus, both simple transition to first differences of the data series, and removal of the smooth 
component of volatility by means of HP-filter, make correlation coefficients of the adjusted process 
statistically insignificant. This fact, combined with the discussed above simple explanation for the 
origin of autocorrelation under non-stationarity, raises doubts about the stochastic nature of volatility. 
However, one still needs to explore in more depth the noisy component of the volatility. We will return 
to this issue in the last section of the paper. 



10 Back to normal distribution 

As was already mentioned in the Introduction, there is a large body of research that study the prob- 
ability distribution of logarithmic returns. The fact of its being non-Gaussian has become generally 
accepted (see, for example [13], [3§|). However, when we speak of the density of probability as func- 
tion of single variable P(r), we obviously assume the stationarity of random numbers r, as we do not 
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involve time dependency. To obtain sufficiently reliable statistical results when inferring P(r), one 
chooses the widest possible interval containing a large amount of data points n. 

However, under non-stationarity such approach significantly distorts the 'true' type of distribution. 
If statistical parameters depend on time, the density of distribution will not be stationary either P(r, t). 
Let us assume that the non-stationarity is parametric and concentrated only in the volatility a(t). 
Suppose also that P(r, t) — P(r, a(t)) is governed by the Gaussian distribution (r t = a(t) ■ e t ): 

P{r,t) = -± 7 =e-^*/^). (24) 
a(t)\'2lT 

Second and forth moments are equal to, respectively: r 2 = (cr 2 (i)) , r 4 = 3 (<r 4 (i)), and in general case, 
despite the Gaussian distribution, its 'aggregated' kurtosis, estimated without taking into account the 
non-stationarity, becomes different from zero: 



ex = 3 • 



(* A (t)) 



(25) 



In our toy model of a 20-year walk with shock volatility doubling, the kurtosis of data equals to 
27/25 = 1.08. In a more general case, the non-Gaussian nature may be affected by other types of 
non-stationarity, for example, the drift of returns: r t = (x(t) + <r(t) ■ e t . 

Let us see what happens with the empirical data after eliminating of the non-stationarity. In order 
to do this we divide all r t by the value of volatility at a given moment of time. We obtain its current 
value by smoothing daily modified amplitudes of range at = (at — |?"t|/2)\/27r/3 using the HP-filter. 
Thus, we apply the following transformation to initial logarithmic returns: 

r ^ r 't = WY (26) 

Such normalization makes random numbers r' t , modulated by a(t) function, stationary. 

Table [5] contains statistical parameters of S&P500 index logarithmic returns for the period 1990- 
2008. The total number of trading days is equal to n — 4791, the share of positive returns is 52.8% 
for all cases. 



Table 5: Statistical parameters of S&P500 index logarithmic returns for three different degrees of 
smoothness parameter v 
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0.055 


1.187 


-0.12 


0.82 


70.4 
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= 4 


0.059 


1.177 


-0.08 


0.51 


69.5 



The first line presents the statistics before the transformation of normalization f26|) . The other 
lines contain statistic parameters after transformation, where smoothing with differing parameter 
v = log 10 A is used. 

Special attention should be paid to the columns excess and p\. We see that smoothing reduces 
drastically the values of these parameters. This is true even for a sufficiently smooth function a(t), 
corresponding to v — 6. In Fig. [24] it is represented by the bold line: 

The smaller parameter v is, the more intensive the flections of volatility a(t) are, because the 
fluctuations of returns start affecting the average. Obviously, in this case a decrease in kurtosis 
takes place, even for stationary non-Gaussian random process. To control this effect, we perform the 
following simulation experiment. We randomly mix the initial pairs of daily returns and volatility 
{r tl at\ in order to eliminate non-stationarity. After that, we apply smoothing with HP-filter, and 
normalization (|26|) both to initial data (original), and to mixed ones (mixed). The charts in Fig. [25] 
present the dependence of the kurtosis (left) and the probability of the fact that returns fall within one 
sigma pi (right) as functions of the smoothing parameter v = log 10 A. It can be easily noticed that to 
the right of v ~ 6 the kurtosis and probability p\ for mixed data decrease insignificantly. At the same 
time, statistical parameters characterizing the non-Gaussian property of initial data decrease rapidly. 
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Figure 25: Kurtosis ex and probability p\ for different v, 1990-2008 



Thus, as the criterion for the optimal meaning of is, one may choose the point where the difference 
between the statistics of mixed and initial data reaches its maximum. 

Another argument for importance of non-stationarity contribution into the non-Gaussian property 
of distribution is the break out of 2008 financial crisis. As can be seen from Table OJ the kurtosis over 
the period 1990-2008 is equal to ex — 10.2. However, it is enough to eliminate just one volatile year 
of 2008, in order to make the kurtosis decrease threefold to ex = 3.8. The number of trading days for 
this calculation is reduced in this case by only 5% to n =4528. 




Figure 26: Kurtosis ex and probability pi for different S&P500, 1990-2007 

The charts in Fig. [26] depict the dependency of kurtosis and probability p\ on the smoothing 
parameter v for mixed and initial data of S&P500 index daily returns for the period 1990-2007. One 
can notice that, although the initial value of kurtosis is relatively small, it nevertheless decreases 
statistically significantly as a result of elimination of non-stationarity from the data. For normalized 
data, the value of kurtosis ex — 1 can be considered as significant, which is four times smaller than 
for initial data. 

Let us plot (see Fig. l2"7j) the histograms of probability density distribution and a graph of normal 
probability (in a way similar to [Hj]), formally based on the initial non-stationary data, as well as the 
same quantities after the normalization procedure (|26| is applied to the data (Fig. [28)1 . 
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Figure 27: Distribution of S&P500 returns 1990-2008 
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Figure 28: Distribution of S&P500 returns for 1990-2008 after normalization 

The unmarked line in the charts corresponds to the Gaussian distribution. A graph of normal 
probability represents dependency y = f(r) of relation F^(y) — F(r), where Fjsi(y) is an integral 
normal distribution, and F(r) is empirical integral distribution for returns. If the empirical distribution 
of F(r) is Gaussian, this graph should be a straight line. We see that after normalization the density 
of probability becomes much more close to normal. Deviations from the straight line are particularly 
evident for the excessively large negative returns because of the rare negative shock impacts to the 
market. 

Let us consider, for comparison, the probability distribution of currency market daily returns 
using the EUR/USD rate for the period 1999-2008 as sample. Basic statistical parameters before 
normalization (first line) and after smoothing with different parameters v are given in Table [6l We 
see that the initial data has relatively small kurtosis, but after smoothing it decreases even further. 
The mean value of volatility after normalization is close to one. This confirms that a = v^/2n/3 is a 
good unbiased estimation of the daily volatility of rate returns. 

Table 6: Statistical parameters of EURUSD rate logarithmic returns for three different degrees of 
smoothness parameter v 
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0.1 


69.3 


v — 


4 


0.022 


0.993 


0.01 


0.1 


69.0 



Testing statistical significance of the decrease in kurtosis and the probability pi shows practically 
zero kurtosis of normalized returns (Fig. [29]). The corresponding histogram and normal probability 
graph are plotted in Fig. [30l As a result we receive a virtually canonical normal distribution with 
deviations that are rather typical for a relatively small sample (n = 2495). 

We shall not conduct a more detailed statistical analysis of distribution form, limiting the argumen- 
tation to these illustrative examples. We infer (see Conclusion) that the observed data is composed 
of the mixture of normally distributed fluctuations of the market, modulated with non-stationary 
volatility, and rare shock impacts. Therefore, even after the elimination of non-stationarity there may 
remain shock outliers, which make the total distribution weakly non-Gaussian. 
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Figure 29: Kurtosis ex and probability pi for different u, EURUSD, 1999-2008 

- 3 -2 -1 1 2 3 , 




Figure 30: Distribution of returns EURUSD, 1999-2008, after normalization 



11 Quasi-stationarity of volatility 

Vanishing autocorrelation coefficients between consecutive values of volatility, generally speaking, do 
not exclude the possibility of its stochastic description. In particular, we can write down the following 
simple discrete process: 

r t =afv t , a t =cr- (1+0 'fit), (27) 

where z/j and //, are independent random variables, while a, are constants. However, within this 
model the interpretation of volatility cr t as a random variable becomes rather superfluous. In fact, we 
come back to the usual stationary model r t = ere*, where Et = vt + ■ Ht^t- In particular, if z/j and 
fii are normally distributed, the distribution for Ei would no more be normal with kurtosis equal to 
60 2 (2 + 2 )/(l +0 2 ) 2 . Nevertheless, the question of local stationarity of 'true' volatility remains open. 

Let us conduct several statistical estimations. First, we consider a modified amplitude of range. 
The spread of its values under constant volatility a occurs due to finite width of distribution density 
P(v). One can obtain its analytical form from the equation (| A5[) of | Appendix A\ and present it as 
the following infinite series: 

, s , a , , s ^f4(2fc-l) 2 8k 2 (l + k 2 -4(k 4 -k 2 )v 2 ) ) 
P(v) = (32,4 _ 9)N{2v) + g | ^-^N, 1 (fc2 J 1)2 j N 2 | , 

where Ni = N(2(2fc — l)v), N 2 = ~N(2kv) are non-normalized Gaussian functions (see | Appendix A[ ). 
Below we list the integral probabilities of the fact that variable a — vsfhtjZ falls within the interval 
[0..cto] (the first line contains values of ctq, the second, corresponding probabilities measured in percent 
points): 

"(15 K& 07f 08 09 O O L2 L3 1A L5 TIT 
0.6 3.3 10.4 22.5 37.8 53.7 68.1 79.5 87.7 93.1 96.4 98.2 



The modified price range v\/2n/3 should remain within the interval [0 .. 1.5] about 96.4% of days; it 
very rarely drops below 0.5. 

If we eliminate (l26l by smoothing procedure with v = 4 the non-stationarity in daily modified 
ranges for EURUSD in 2007-2008 years, the residual series has dynamics as shown in Fig. 1311 
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Figure 31: Stationary price range of EURUSD in 2007-2008 

In case of Brownian random walk, dotted lines correspond to the probability 96% of staying within 
the interval 0.5 < v^/2n/3 < 1.5. We see that, except for rather rare outliers, most of daily volatilities 
estimated by modified amplitude of probability, fell into the dotted corridor. The number of outliers 
is slightly higher than expected 4% (as there is 250 trading days in a year, 250*4%=10). This small 
excess of extremal values may be interpreted (especially in 2008, a crisis year) as occasional shock 
impacts to the market, not related to its 'typical' intrinsic dynamics. 

As we have discussed above, the 'daily' volatility can be estimated not only by means of modified 
amplitudes of range, but also by calculating its value on the base on intraday lags, i.e. 15-minute ticks. 
In Fig. [32l the dynamics of volatility is presented, after the elimination of non-stationarity, obtained 
by the latter method for the period of 2007-2008 for EUR/USD exchange rate. In this case the spread 
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Figure 32: Intraday volatility of EURUSD in 2007-2008 

of values is related to the finiteness of the sample that is used for volatility calculations. In order to 
determine the significance level, one has to know the corresponding distribution of probability. As we 
know, the error of stationary volatility calculation is determined by fourth moments and, in case of 
large kurtosis, it will be quite large. 

The intraday 15-minute data distribution has significant kurtosis. Straightforward computation 
of kurtosis for EURUSD during 2004-2008 yields the value of 20, which is due to the long-term non- 
stationarity, the substantial cyclic effects in intraday activity, as well as several other specific reasons 
(see [30] , [32j for a detailed discussion), into which we will not delve here. 

In order to obtain the significance levels, we will conduct the following simple experiment with 
the data. Let us calculate the logarithmic return basing on 15-minute lags of EUR/USD rate. Then, 
to preserve the intraday periodicity, we shall mix data points with the same intraday time. In other 
words, we randomly shuffle all lags at 00:00, then apart from them we mix lags of 00:15, etc. For 
these synthetic data, that are free of any memory effects except the intraday cycles, we calculate the 
meaning of intraday volatility. Then we normalize the series so that the mean is equal to 1, and 
plot the corresponding distribution of probability. It turns out that about 96% of data stays within 
the 1 ± 0.4 corridor. It is these levels, which characterize the 'typical' range of volatility due to the 
finiteness of data, that are marked at the above chart with dashed lines. We see that the data fits 
into the corridor quite well. 

We stress that the computations performed above are rather a qualitative estimation than a strict 
statistical analysis; such analysis might not be appropriate at all without constructing a complete 
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model of non-stationarity of data at different time scales. However, an assumption about the local 
constancy of volatility of daily lag returns appears rather plausible. In other words, the daily volatility 
of the market most probably is described by a smooth, rather slowly varying function of time. At any 
moment, its value can be considered locally constant, and it determines the stochastic dynamics of 
price returns for a financial instrument. 

12 Conclusion 

Let us reiterate the main inference that we argued for in this paper: 

Volatility and other statistical parameters should be regarded as gradually changing func- 
tions of time. They determine locally quasi-stationary stochastic dynamics of prices for fi- 
nancial instruments. There are rare and irregular shock impacts that influence the markets, 
resulting in shifts in daily returns, and as they accumulate, affect the value of long-term 
volatility. 

The situation resembles the deformation of a plastic material after a series of impacts, and the 
gradual restoration of form after external influence is terminated. The study of properties of such 
resilience of volatility are of great importance, especially for forecasting the time of its reversal to the 
long-term typical levels. 

Therefore, the stochastic nature of markets is determined by the following two components: 1) 
intrinsically Gaussian-distributed daily returns with slowly changing volatility; and 2) rarely occurring 
shock impacts. These shocks are assumed to be essentially unpredictable, but their impact on the 
volatility as well as its subsequent evolution should be the subject of research. 

Actually, shocks are quite inconspicuous; in reality it is quite difficult to separate the 'unnatural' 
behavior of the market as a result of shocks from 'normal' volatility. Financial analysts and economic 
commentators never fail to find the piece of news to account for all price spikes and crashes. On the 
other hand, such events as Lehman's bankruptcy can hardly be considered everyday news. 

Volatility can also gradually increase as a result of relatively insignificant negative news back- 
ground, provided that such background lasts for long enough. Thus, a gradual increase in volatility 
since the beginning of 2007 was a result of precisely such 'soft' pressure on the markets from the real 
estate sector. Since the autumn of 2008, this growth has been explosive and unprecedented for the 
modern history of financial markets. As we know, it originated from financial sector, and triggered an 
avalanche-like effect of confidence crisis and widespread panic. All this, eventually, delivered a blow 
to the real sector of economy. 

Finally, an increase in volatility usually accompanies 'unmotivated' booms in the market, when 
a financial bubble starts to inflate. High volatility also persists in the period of its collapse. When 
market goes into a 'quiet' phase of growth, volatility usually slowly decreases. 

Peaks typically observed in the charts of non-stationary volatility bring up the analogy with res- 
onance phenomena in physics. Such connection implies the existence of certain equations describing 
the system dynamics. There is no doubt that a relaxation mechanism exists, ensuring that a decay of 
system excitations happens after a certain period, determined by the life time of the resonance. 

When one speaks about a gradual course of change in volatility, one should keep in mind that it 
refers to the 'typical' long-term market situations. Sometimes, however, jump- like changes in statistical 
parameters occur, which determine the stochastic dynamics of the price process. It seems plausible 
that such a qualitative shift in market behavior happened in September 2008. In contrast, the exit 
from this instability, and return to equilibrium, is likely to be quite gradual and prolonged. 

We infer that the non-Gaussian nature of markets stems from two origins. First, it is the artifact of 
uncritical postulation of stationarity under conditions when it doesn't really exist. This component can 
be removed, at least in theory. After the data is transformed into a stationary form, the non-Gaussian 
features reduce significantly. However, the rare shock impacts, which are the second origin, even when 
combined with stationary Gaussian returns, still render the distribution weakly non-Gaussian. This 
is particularly evident in the case of stock market, which has the after-hours periods when negative 
or positive news accumulate. When the markets open, a possibility appears of a 'single emission' of 
accumulated emotions. Around-the-clock foreign exchange markets can respond to the development 
of such shocks in more 'subdued' way. 
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Autocorrelation coefficients in various volatility measures also arise due to the non-stationarity of 
the data and disappear after it is eliminated. In this sense, they are indeed the evidence of long- 
term memory, but do not have anything to do with the short-term stochastic properties of volatility, 
which are assumed in corresponding autoregressive models. Therefore, further research should focus 
on forecasting the smooth dynamics of volatility, rather the stochastic theories of volatility behavior. 
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Appendix A Brownian walk 



In this appendix we provide the basic expressions for Brownian motion described by the stochastic 
equation dx = (xdt + crSW. Let us first consider the case of driftless process (fi = 0). Without any 
loss of generality, we may assume that at the initial moment of time x(0) = 0. The maximal and 
minimal values of x for the period ^ t ^ T are equal to H and L, respectively, and r — x(T). The 
height h = H of ascent and the depth I — —L of descent are always positive, and —l^r^h. The 
amplitude of range is equal to a = h + I. Below we consider the case of unit volatility a = 1 and unit 
time interval T = 1. To restore the original notation, it is necessary to substitute r — + rjayT for the 
dimensionfull variables r, h, I, a. The same should be done in the differentials dr, etc. in the integrals 
containing the probability densities. In order to make the formulae more concise, we use this notation 
for the normal distribution function: N(x) = e~ x ^ 2 /V2tt. 

• We start with the relation for probability that x does not rise above h and does not fall below 
—I, when the closing return is r: 

oo 

p(-l < L,H <h,r)= |N(r + 2fca) - N(r + 2Z + 2fca) } . (Al) 



This formula was first received by [40(. We also note an exclusively useful reference book by [4l|. 
Distributions for other variables are derived from the probability 1|A1[) . For the return, height and 
depth we have: 

P{r) = N(r), P(h) = 2N(h), P(l) = 2N(Z). (A2) 

The density of probability for the range a is expressed in the form of an infinite series over Gaussian 
basis: 

oo 

P(a) = 8^(-l) fe+1 -fc 2 -N(fca). (A3) 

fe=l 

This series converges rather quickly for all a ^ 0. A characteristic property of Feller's distribution 
P(a) is an extremely rapid decline in the density of probability for large values of a. Here is a sample 
of values of integral probabilities F(a) = p(H — L < a): 

a 0.750 1.000 1.500 2.000 2.500 3.000 3.500 4.000 
F(a) 0.002 0.063 0.487 0.819 0.950 0.989 0.998 1.000 

The a parameter is smaller than 0.75 (a = 1) only in 2 cases out of 1000. The mean value is a — 1.5958, 
the variance is a a — 0.29798 • a. The one sigma interval (a ± a a = [1.120 .. 2.071]) contains 71.6% 
of all a values, while the double sigma interval (a ± 2a a = [0.645 .. 2.547]) contains 95.6%; and data 
points outside of the latter interval should, in reality, occur only for a above the mean. 

The joint densities of probability for height (r ^ h), depth (— I < r) and range (|r| ^ a) have the 
following form: 

P(h,r) =2(2h-r) -N(2h-r), P(l, r) = 2(21 + r) ■ N(2Z + r). (A4) 

OO 

P{a,r)=A k-{-\r\-k(2k + 3)a + k-(a-\r\)(2ka+\r\) 2 }-N(\r\+2ka). (A5) 

k — ~ OO 

Note also that P(a, -r) = P(a, r), and P(a, \r\) = 2P(a, r). 

• Let us provide a table of mean values for different variables (where v = a — |r|/2): 

F = 0, r* = l, ^" = 0, ^=3, 

2 P = l, h? = J-, F = 3, 

7T V 7T 

8 — — (2tt) 3 / 2 — 

a 2 = 41n2, a 3 = i-^ — , a 4 = 9-C[3], 

^ = 41n2--, ^3 = ^l±i ; ^ = 61n2- — + --C[3], 



2ir 4' 6V2^ 16 



25 



where £[n] = J2T=i ^ n ls a Rieman (^-function. The mean values for I and |r| are the same as for h. 
The means of certain cross-products are given below: 

3 



Jr n = (-l) n • hr n , ar 2n+1 =0, ar 2n = 2-hr 2n , a\r\ y 

Expressions for other mean values, as well as their generating function, can be found in 

For a process with a non-zero drift dx = \idt + aSW, we shall use the above-determined driftless 
densities. In order to restore time T and variance a, we should additionally substitute the shift as 
follows: [i — > /iT/aVT. The density of probability for returns is equal to: 

P M (r) = N(r - m) = e^-^l 2 P(r). 

Expressions for joint probability densities (4l| : 

P„(h, r) = er-^' 2 P(h, r), P„(Z, r) = e^^' 2 P(Z, r), 

P^a, r) = e ^ 2 /2 P( a , r), P^h, I, r) = e^^' 2 P(h, I, r). 

Thus, the densities corresponding to /i = are always multiplied by a factor e M '' _AI I 2 . In the presence 
of drift we obtain: 

r = /i, ^=l + fi 2 , ^=3^ + ^ 3 , r i = 3 + 6/u. 2 + ^ . 

Exact expressions for mean values of other variables are rather cumbersome. However, as for financial 
data the condition fx <C er = 1 holds, it is acceptable to decompose a factor e Mr ~ M I 2 into a series and 
to use means for the case /x = 0. As a result we receive: 

E=^+$ + -T=--^ + - R= + (A6) 

V 7T 2 3V2T 60V2T V 7T V2T 12V2T 

V 7T 2 3V2^ 60%/2^ V 7T 3V2^ 30^2^ 

The mean values of height and depth are linear in fi, and only even powers of /x are present in the tail 
of expansion. The means of lag range and absolute returns contain only even powers of \i. Note also 
the following simple relations, available in closed form: 

h-l = r = fx, h 2 +l 2 = 2 + ^ 2 , hr=W-l/2, 77 = 1/2 - P. 



Appendix B Measures of volatility 

The width of probability distribution of a positive random variable z > can be characterized with a 
relative error cr z /z, where a z as usual denotes the standard deviation a 2 = [z — z) 2 . 

Note that the relative width of distributions for z and z 2 are different, and thus actually there 
are different criteria for optimality of volatility measurement. For example, in order to calculate the 
stationary volatility one usually uses averaging of either squared returns, or the squares of the lag 
ranges [2J|: 

1 " 1 ™ 2 

4=1 t=l 

As in this paper we examine the non-stationary nature of volatility and use the non-linear HP-filter 
for smoothing, it is more convenient to average volatilities a proper, rather than their squares; the 
latter, as we will see below, yield a biased value of a for small n. Nevertheless, considering the various 
measures of volatility, we will calculate the relative width of both the value its square. 
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Let us recite some well-known volatility estimators. We shall use a Parkinson measure (1980) [24| 
as a base; it is equal to the amplitude of range vp = a. Garman and Klass (1980) [25], working in the 
class of analytic functions of h, I, r, proposed the following optimal combination, which is a better 
measure than that of Parkinson: 

v 2 GK = 0.511 • a 2 - 0.019(r ■ (h - I) + 2h ■ l) - 0.383 • r 2 . (B9) 

A simpler and drift-independent \i measure is suggested by Rogers and Satchell (1991) [26| : 

v\ s = h- (h- r) + 1 ■ (l + r). (BIO) 

Let us show that the simplest linear modification of Parkinson's measure 

vp=a-0-\r\ (Bll) 

where (3 > is a constant, leads to a narrower distribution than the amplitude of range. If we use 
relative volatility <j v as a criterion of narrowness, it is not difficult to find the optimal value of the 
coefficient f3 using the means from the Appendix A: 



(a — Q. | r h2 

— H _ 9 = min => /3 = 6 - 8 In 2 » 0.455. (B12) 

(a-P-\r\f 

However, <r v /v is not the only criterion, and due to the low sensitivity of the relative volatility to 
change in /?, we use in this paper the value P = 1/2 and notation v = a — |r|/2. In what follows we 
denote vp — a — 0.455 • |r|. 



We note that there is another simple measure of volatility, comparable in its effectiveness to (IB11|) , 
namely: 

v F = %- -. (B13) 

l + r 2 /a 2 v ' 

Although the probability of zero value a for finite duration of a lag T is vanishingly small, it is still 
necessary to define the corresponding value vf = for a = 0. Actually, the relations (IB11|) and l|B13l) 
are not analytic functions on a and r, and thus are not governed by the lemma from Appendix B of 

• In addition to the width of distribution, sometimes absent or weak dependence on the drift /i 
are used as a criterion. Note that for daily, or shorter, lags [i <C a; therefore, this criterion is not that 
significant. The above proposed measure of the modified lag range, as well as the price range itself, 
depends on \x. However, this dependence is significantly weaker for v than for the amplitude a. If we 
use the presentations l|A6j) . (|A7[) . we can write the following expression for vp: 

, f -p-fii.Jl + 2=W_£dV>£ + ._ (B14) 

V 7T 3^2^ 60^2^ 

It can be seen that the factor beside [i 2 for P = 1/2 is four times smaller than for P = (vp = a). 
Consequently, the dependence on [i is four times weaker as well. When p = 2/3 (denoted u 2 /3 below) 
the coefficient at /x 2 becomes equal to zero, and the dependence on fx is weakening still, although it 
disappears completely only for the measure by Rogers and Satchell. 

• Let us now compare the statistical parameters of different volatility measures shown in Table [3 



Table 7: Statistical parameters of different volatility estimators, derived analytically (upright) and 
numerically (italic) 
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ex 
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<j v /v 


<T V 2/V 2 


Vp 


1.596 


2.773 


0.476 


0.97 


1.24 


70.6 


0.298 


0.638 


VRS 


0.960 


0.998 


0.275 


0.46 


0.42 


69.5 


0.287 


0.576 


V2/3 


1.064 


1.217 


0.292 


0.52 


0.29 


68.4 


0.275 


0.557 


VGK 


0.968 


0.998 


0.245 


0.60 


0.39 


68.6 


0.253 


0.519 


V F 


1.254 


1.673 


0.316 


0.53 


0.28 


68.4 


0.252 


0.513 


V 


1.197 


1.523 


0.300 


0.53 


0.26 


68.2 


0.251 


0.511 


Vp 


1.233 


1.615 


0.308 


0.55 


0.29 


68.3 


0.250 


0.510 
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We use italic font to mark the values obtained by Monte Carlo simulation for 3.5 million lags, each 
being a random walk of 1 million ticks. In this case, for means and volatility an error of order of 
±0.002 is possible in the last significant digit. The other values (in upright font) are derived through 
analytical calculations. 

• For non-stationary data it is often necessary to conduct 
the averaging over a relatively small number of observations 
n. In this bias becomes apparent in quadratic mea- 

sures for volatility a. Even if one calculates the classical 
squared volatility a\ by means of the unbiased formula l|B8p , 
the value <7r will be biased; indeed, when averaging over 
large numbers of samples of size n, we have < a 2 R >= a 2 , but 
< \fo%. >^= a. If we are interested in the value of volatility 
itself rather than its square, it is better to use linear rather 
than quadratic measures for non-stationary data. 

To illustrate the effect of drift we provide charts of mean 
values of volatility (Fig. [331 , obtained by averaging a large 
number of samples of n values each, for standard definition 




Figure 33: Mean values of volatility for 
different sample sizes n for several esti- 
mators. 



of an and ana — y Vrg measure l|B9l) compared to a linear 
measure of a = (a — \r\/2)^/2n/3. 

Thus, the measure v = a — \r\/2 has a relatively narrow distribution and consecutively results in 
smaller error in volatility measurement. Simplicity is its obvious advantage, as compared with the 
measures w^s and vqk- In addition, it is unbiased in case of small sample size, which is significant in 
examining the effects of non-stationarity. 
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